Abstract. Feature selection algorithms can reduce the high dimensionality of textual cases and increase case-based task performance. However, conventional algorithms (e.g., information gain) are computationally expensive. We previously showed that, on one dataset, a rough set feature selection algorithm can reduce computational complexity without sacrificing task performance. Here we test the generality of our findings on additional feature selection algorithms, add one data set, and improve our empirical methodology. We observed that features of textual cases vary in their contribution to task performance based on their part-of-speech, and adapted the algorithms to include a part-of-speech bias as background knowledge. Our evaluation ...
A major obstacle that decreases the performance of text classifiers is the extremely high dimensiona...
Data reduction is an important step in knowledge discovery from data. The high dimensionality of dat...
Machine learning for text classification is the cornerstone of document categorization, news filteri...
AbstractThis paper presents a novel rough set-based case-based reasoner for use in text categorizati...
In this paper, we develop and analyze four algorithms for feature selection in the context of rough ...
Application of a feature selection algorithm to a textual data set can improve the performance of so...
This paper shows that psychological constraints on human information processing can be used effectiv...
Abstract—Huge number of documents are increasing rapidly, therefore, to organize it in digitized for...
Feature selection aims to determine a minimal feature subset from a problem domain while retaining a...
In this paper, we study the feature selection problem and develop and analyze four algorithms for fe...
Machine Learning techniques can be used to improve the performance of intelligent software systems. ...
Textual document set has become an important and rapidly growing information source in the web. Text...
Feature selection methods are often applied in the context of document classification. They are part...
Feature selection refers to the problem of selecting those input features that are most predictive...
The term “feature selection” refers to the problem of selecting the most predictive features for a g...
A major obstacle that decreases the performance of text classifiers is the extremely high dimensiona...
Data reduction is an important step in knowledge discovery from data. The high dimensionality of dat...
Machine learning for text classification is the cornerstone of document categorization, news filteri...
AbstractThis paper presents a novel rough set-based case-based reasoner for use in text categorizati...
In this paper, we develop and analyze four algorithms for feature selection in the context of rough ...
Application of a feature selection algorithm to a textual data set can improve the performance of so...
This paper shows that psychological constraints on human information processing can be used effectiv...
Abstract—Huge number of documents are increasing rapidly, therefore, to organize it in digitized for...
Feature selection aims to determine a minimal feature subset from a problem domain while retaining a...
In this paper, we study the feature selection problem and develop and analyze four algorithms for fe...
Machine Learning techniques can be used to improve the performance of intelligent software systems. ...
Textual document set has become an important and rapidly growing information source in the web. Text...
Feature selection methods are often applied in the context of document classification. They are part...
Feature selection refers to the problem of selecting those input features that are most predictive...
The term “feature selection” refers to the problem of selecting the most predictive features for a g...
A major obstacle that decreases the performance of text classifiers is the extremely high dimensiona...
Data reduction is an important step in knowledge discovery from data. The high dimensionality of dat...
Machine learning for text classification is the cornerstone of document categorization, news filteri...